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ACTIVE LOG READ I/O BALANCING FOR LOG DUPLEXING 

FIELD OF THE INVENTION 

5 

This invention relates in general to computer- 
implemented database systems, and, in particular, to. 
use of dual copies of logs . 

10 BACKGROUND OF THE INVENTION 

Databases are computerized information storage and 
retrieval systems. A Relational Database Management 
System (RDBMS) is a database management system (DBMS) 

15 which uses relational techniques for storing and 

retrieving data. Relational databases are organized - 
into tables which consist of rows and columns of data. 
The rows are formally called tuples. A database will 
typically have many tables and each table will 

20 typically have multiple tuples and multiple columns . 
The tables are typically stored on direct access 
storage devices (DASD) , such as magnetic or optical 
disk drives for semi -permanent storage. 

25 A table is assigned to a tablespace. The 

tablespace contains one or more datasets. In this way, 
the data from a table is assigned to physical storage . 
on DASD. Each tablespace is physically divided into 
equal units called pages. The size of the tablespace ' s 

30 pages is based on the page size of the buffer pool 

specified in the tablespace ' s creation statement. The 
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bufferpool is an area of virtual storage that is used 
to store data temporarily. A tablespace can be 
partitioned, in which case a table may be divided among 
the tablespace 's partitions, with each partition stored 
5 as a separate dataset , Partitions are typically used 
for very large tables. 

RDBMS software using a Structured Query Language 
(SQL), interface is well known in the art. The SQL 

10 interface has evolved into a standard language for 

RDBMS software and has been adopted as such by both the 
American National Standards Institute (ANSI) and the 
International Standards Organization (ISO) . The SQL 
interface allows users to formulate relational 

15 operations on the tables either interactively,, in batch 
files, or embedded in host languages, such as C and 
COBOL. SQL allows the user to modify the data and as 
the data is being modified, all operations on the data 
are logged in a log file. 

20 

When a primary copy of a database is corrupted, 
for example, due to a system or media failure, or when 
a user wants to remove modifications to the data (i.e., 
back out the changes) , the primary copy of the database 
25 is recovered by applying changes found in the log to a 
secondary copy of the database. That is, through use 
of the log file, the copied data is modified, based on 
the operations and data found in the log file. 

30 Typically, for database systems, it is recommended 

that dual copies (duplex) of the log be maintained to 
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prevent a single pointr-of -failure . The recommendation 
is to maintain the secondary copy of the log on a 
separate I/O path from the primary copy of the log. In 
the .event of a failure reading the primary copy of the 
5 log, the secondary copy of the log is used to satisfy 
the request - 

Normally, all log read requests are directed to 
the primary copy of the log, reserving the secondary 

10 copy of the log for error recovery. However, in the 
case where a system failure has occurred, but the 
primary copy of the log remains accessible, a number of 
recovery tasks often run in parallel. Accordingly, it 
often occurs that there are a number of substantially 

15 concurrent requests to the log. This action results in 
a queuing of the requests and a delay in the recovery- 
act ion. 

It is therefore an object of this invention to 
20 remove a system recovery bottleneck that is presented 
when too many requests to the log are queued. 

It is another object of the invention to provide a 
method for more efficient use of the secondary copy of 
25 the log. 

SUMMARY OF THE INVENTION 

The invention is embodied in a method for enabling 
improyed^^access to data stored in a log of a computer j 
30 memory system, wherein the computer memory system has 
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multiple_copies of said log, i.e., a primary log and a 
secondary log. Each log stores data transactions with 
a database system stored on the computer memory system. 
The method initially responds to a process request for 
5 access to a log, by determining a parameter indicative 
of demand for access to one of the copies of the log. 
If the parameter has reached a threshold value, 
accessing processes are thereafter distributed between 
the primary and secondary copies of the log so as to 
10 balance the work dispatched to the respective log 

copies . The invention is implemented by a computer 
system and may also be incorporated into a memory media 
device such as a magnetic disk. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Referring now to the drawings in which like 
reference numbers represent corresponding parts 
throughout : 

20 

FIG 1 illustrates an exemplary computer hardware 
environment that implements the present invention. 

Fig. 2 is a high level flow diagram that 
25 illustrates the method of the invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

In the following description of the preferred 
embodiment, reference is made to the accompanying 
5 drawings which form a part hereof, and in which is 

shown by way of illustration, a specific embodiment in 
which the invention may be practiced. It is to be 
understood that other embodiments may be utilized 
without departing from the scope of the present 
10 invention. 

Hardware Environment 

FIG- 1 illustrates an exemplary computer hardware 
environment that implements the present invention. 

15 Computer system 102 is comprised of one or more 
processors connected to one or more data storage 
devices 104 and 106 that store one or more relational 
databases, such as a fixed or hard disk drive, a floppy 
disk drive, a CDROM drive, a tape drive, or other 

20 device. 

Operators of computer system 102 use a standard 
operator interface 108, such as IMS/DB/DC*, CICS*, TSO*, 
OS/3 90*, ODBC* or other similar interface, to transmit 
electrical signals to and from computer system 102 that 

25 represent commands for performing various search and 
retrieval functions, termed queries, against the 
databases. In the present invention, these queries 
conform to the Structured Query Language (SQL) 
standard, and invoke functions performed by Relational 

30 DataBase Management System (RDBMS) software. 
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The SQL interface allows users to formulate 
relational operations on the tables either 
interactively, in batch files, or embedded in host 
5 languages, such as C and COBOL. SQL allows the user to 
modify the data. In the preferred embodiment of the 
present invention, the RDBMS software comprises the DB2* 
product offered by IBM for UDB ("Universal Database") 
for the OS/3 90* operating system. Those skilled in the 
10 art will recognize, however, that the present invention 
has application to any RDBMS software, whether or not 
the RDBMS software uses SQL. 

As illustrated in FIG. 1, the DB2* system for UDB 
15 for the OS/3 90* operating system includes three major 
components: Internal Resource Lock Manager (IRLM) 110, 
Systems Services module 112 and Database Services 
module. IRLM 110 handles locking services for the DB2* 
system, which treats data as a shared resource, thereby 
20 allowing any number of users to access the same data 
simultaneously. Thus concurrency control is required 
to isolate users and to maintain data integrity. 

Systems Services module 112 controls the overall 
25 DB2* execution environment, including managing log data 
sets 106, gathering statistics, handling startup and 
shutdown, and providing management support. At the 
center of the.DB2* system is Database Services module 
118 that contains plural sxibmodules, including 
30 Relational Database System (RDS) 116, Data Manager 118, 
Buffer Manager 120, Logging System 122 and other 
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components 114 such as an SQL compiler/interpreter . 
These submodules support the fiinctions of the SQL 
language, i.e. definition, access control, 
interpretation, compilation, database retrieval, and 
5 update of user and system data. 

The present invention is generally implemented 
using SQL statements executed under the control of 
Database Services module 118 , Database Services module 

10 118 retrieves or receives the SQL statements, wherein 
the SQL statements are generally stored in a text file 
on data storage devices 104 and 106 or are 
interactively entered into computer system 102 by an 
operator at monitor 124 via operator interface 108 . 

15 Database Services module 118 then derives or 

synthesizes instructions from the SQL statements for 
execution by computer system 102. 

Generally, the RDBMS software, the SQL statements, 
and the instructions derived therefrom, are all 
tangibly embodied in a computer- readable medium, e.g. 
one or more of the data storage devices 104 and 106 . 
Moreover, the RDBMS software, the SQL statements, and 
the instructions derived therefrom, are all comprised 
of instructions which, when read and executed by 
computer system 102, cause computer system 102 to 
perform the steps necessary to implement and/or use the 
present invention. Under control of an operating 
system, the RDBMS software, the SQL statements, and the 
instructions derived therefrom, may be loaded from data 
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storage devices 104 and 106 into a memory of computer 
system 102 for use during operations. 

Thus, the present invention may be implemented as 
5 a method, apparatus, or article of manufacture using 
standard programming and/or engineering techniques to 
produce software, firmware, hardware, or any 
combination thereof. The term "article of manufacture" 
(or alterTiatively, "computer program product" or 

10 ^'memory media") as used herein is intended to encompass 
a computer program accessible from any computer- 
readable device, carrier, or media. Of course, those 
skilled in the art will recognize many modifications 
may be made to this configuration without departing 

15 from the scope of the present invention. 



Active Log I/O Balancing for Log Duplexing 

The present invention provides a logging system 
20 122 for reducing I/O device contention for the primary 
copy of a log. The invention reduces the time required 
to recover from media or system failures . 

In some recovery scenarios (e.g., recovering from 
25 a database media failure) , there may be a number of 

processes concurrently reading from the primary copy of 
the log. This may generate I/O contention for the 
device (s) containing the primary copy of the log. As 
indicated above, this will extend the recovery period. 

30 
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Logging system 122 relieves some of the device 
contention by satisfying a portion of the log read 
requests from the secondary copy of the log in periods 
of concurrent log read activity. In the embodiment 
5 illustrated in the logic flow diagram of Fig, 2, 
logging system 122 maintains a count of active 
processes reading the log (step 200) . If the count is 
below a threshold count, logging system 122 assigns a 
preference for the primary copy of the log to new 
10 processes requesting data from the log (step 2 02) . 

If the threshold count is reached, logging system 
122 distributes preference assignments between the 
primary copy of the log and the secondary copy of the 

15 log, in respect of new processes requesting data from 

the log (step 204) . It is preferred that the preference 
assignments be alternated as between the log copies so 
as to balance the work of the respective logs and 
reduce possible delay times. For a process assigned a 

20 preference for the respective copy of the log, log read 
requests are satisfied from the respective copy of the 
log (step 206) . This action reduces queuing time and 
quickens system recovery time. 

25 The foregoing description of the preferred 

embodiment of the invention has been presented for the 
purposes of illustration and description. It is not 
intended to be exhaustive or to limit the invention to 
the precise form disclosed. Many modifications and 

30 variations are possible in light of the above teaching. 
For example, instead of keeping count of processes 
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attempting to access the primary log, a count of 
requests that have been queued to the primary log may 
be used. It is intended that the scope of the 
invention be limited not by this detailed description, 
5 but rather by the claims appended hereto. 
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